Selective Sampling with Drift

نویسندگان

  • Edward Moroshko
  • Koby Crammer
چکیده

Recently there has been much work on selective sampling, an online active learning setting, in which algorithms work in rounds. On each round an algorithm receives an input and makes a prediction. Then, it can decide whether to query a label, and if so to update its model, otherwise the input is discarded. Most of this work is focused on the stationary case, where it is assumed that there is a fixed target model, and the performance of the algorithm is compared to a fixed model. However, in many real-world applications, such as spam prediction, the best target function may drift over time, or have shifts from time to time. We develop a novel selective sampling algorithm for the drifting setting, analyze it under no assumptions on the mechanism generating the sequence of instances, and derive new mistake bounds that depend on the amount of drift in the problem. Simulations on synthetic and real-world datasets demonstrate the superiority of our algorithms as a selective sampling algorithm in the drifting setting.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining similarity in time and space for training set formation under concept drift

Concept drift is a challenge in supervised learning for sequential data. It describes a phenomenon when the data distributions change over time. In such a case accuracy of a classifier benefits from the selective sampling for training. We develop a method for training set selection, particularly relevant when the expected drift is gradual. Training set selection at each time step is based on th...

متن کامل

Time-of-arrival estimation by UWB radios with low sampling rate and clock drift calibration

In this paper, we propose a time-of-arrival (TOA) estimation scheme using impulse-radio ultra-wideband (IR-UWB). This scheme is featured by a low sampling rate and is robust against clock drift. Low-rate stroboscopic sampling, which can achieve an equivalent sampling rate as high as the Nyquist sampling rate, is adopted to achieve a high resolution TOA estimate by IR-UWB. Since a long preamble ...

متن کامل

New Coated Graphite Potentiometric Sensor for Selective Determination of Copper (II) Ions

A highly selective copper (II) coated graphite sensor was prepared by 1,13-bis(8-quinolyl)- 1,4,7,10,13-pentaoxatridecane (kryptofix 5) as a supramolecule ionophore into plasticized polyvinyl chloride (PVC) membrane. The best response characteristic was observed using the membrane composition of PVC = 30.0 mg, dioctyl sebacate (DOS) = 63.5 mg, palmitic acid (PA) = 3.0 mg and kryptofix 5 = 3.5 m...

متن کامل

Asymptotically Optimal Importance Sampling and Stratification for Pricing Path-dependent Options

This paper develops a variance reduction technique for Monte Carlo simulations of path-dependent options driven by high-dimensional Gaussian vectors. The method combines importance sampling based on a change of drift with stratified sampling along a small number of key dimensions. The change of drift is selected through a large deviations analysis and is shown to be optimal in an asymptotic sen...

متن کامل

Particle Identification with a Fine Sampling Ionization

Introduction charge collection) of Q 104. The drift field was chosen to give an electron drift velocity of 1 cm/usec. %1/5 the saturated velocity in this gas mixture. The volume of uniform field (adequate for longitudinal drift measurenents of dE/dx) extends to within +3 mm of the sense plane and +l mm of the cathode planes. Hence, in principle, 80% of the track length through a chamber of this...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2014